An investigation of the use of dynamic time warping for word spotting and connected speech recognition
نویسندگان
چکیده
Several variations on algorithms for dynamic time warping have been proposed for speech processing applications. In this paper two general algorithms that have been proposed for word spotting and connected word recognition are studied. These algorithms are called the fixed range method and the local minimum method. The characteristics and properties of these algorithms are discussed. It is shown that, in several simple performance evaluations, the local minimum method performed considerably better then the fixed range method. Explanations of this behavior are given and an optimized method of applying the local minimum algorithm to word spotting and connected word recognition is described.
منابع مشابه
Robot Arm Performing Writing through Speech Recognition Using Dynamic Time Warping Algorithm
This paper aims to develop a writing robot by recognizing the speech signal from the user. The robot arm constructed mainly for the disabled people who can’t perform writing on their own. Here, dynamic time warping (DTW) algorithm is used to recognize the speech signal from the user. The action performed by the robot arm in the environment is done by reducing the redundancy which frequently fac...
متن کاملTitle Connected Spoken Digit Recognition by Augmented Continuous DP Matching and its Evaluation
Recently, we proposed the Augmented Continuous dynamic time warping algorithm for connected spoken word recognition. The algorithm is based on the same principle as the Two Level DP and Level Building DP. Our algorithm obtains a near optimal solution for the recognition principle based on pattern matching. However, it is computationally more efficient than the conventional methods and does not ...
متن کاملSpoken Digit Recognition by AugmentedContinuous DP Matching and its Evaluation
Recently, we proposed the Augmented Continuous dynamic time warping algorithm for connected spoken word recognition. The algorithm is based on the same principle as the Two Level DP and Level Building DP. Our algorithm obtains a near optimal solution for the recognition principle based on pattern matching. However, it is computationally more efficient than the conventional methods and does not ...
متن کاملSpeech Recognition for Keyword Spotting using a Set of Modulation Based Features – Preliminary Results
We present the preliminary results of applying a set of parameters of the AM-FM model for recognizing word utterances. By acquiring modulation based parameters from the amplitude envelope (AE) and the instantaneous frequency – both obtained by demodulating at four selected center frequencies – a compact feature set is created for each frame of a word utterance. Applying a dynamic time warping o...
متن کاملAn Utterance Recognition Technique for Keyword Spotting by Fusion of Bark Energy and MFCC Features
This paper describes the preliminary results of a keyword spotting system using a fusion of spectral and cepstral features. Spectral energy in 16 bands of frequencies on Bark scale and 16 mel-scale warped cepstral coefficients are used independently and in combination with appropriate weights for recognizing word utterances. Results of matching features using Euclidean and cosine distances in a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1980